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PSSMs producing significant alignments: 



Score E 
(bits) value 



i44 



gnllCDDI5069 pfam033 1 1 , Comichon, Comichon protein 



142 le-35 



^^^^^".^^^^v^s^•.w.•v^-,\vww^v^vvvv^s^s^^^■^■^.-,^ 



gnHCDD15Q69 . pfam03311, Comichon, Comichon protein. 



Query: 6 
Sbjct: 1 



CD-Length = 130 residues, 89.2% aligned 
Score = 142 bits (360), Expect = le-35 

AAFCYMLALLLTAflLIr^^-'AlWHIIAFDELKTDYKNPTDQCNTLNPL^^^^ 6 5 

AAFCFIITLILNCALIFFQIYFVIAYADLETDYINPIDQCRRLt^RLVLPEYGL^:^^ 60 



Query: 66 FLCAAEWL T LGL NM PL LAY HI WRY MS R P vl4SG PGL YD PTT I MNAD I LAYVQKEGV^^^^ 122 
Sb j ct : 61 FLLSGHWFMFLLNLPLLAYNIWRYMKRPv^MSHP-L YDPTTIFNRDTLKSCQKES'WIK 116 
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BLASTP 2.2,6 [Apr.09.2003] 



Reference ; 

Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search 
programs". Nucleic Acids Res. 25:3389-3402. 

RID: 1055020477-012902-31232 
Query= 

(144 letters) 



Database: All non-redundant GenBank CDS 
translations+PDB+SwissProt+PIR+PRF 

1,442,039 sequences; 463,592,631 total letters 

If you have any problems or questions with the results of this search 
please refer to the BLAST FAQs 

Taxonomy reports 



Distribution of 36 Blast Hits on the Query Sequence 



Mouse-over to show defline and scores. Click to show alignments 
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qi|l7137G46|ref INP 477068, l| 
qil 67 53470 I reflNP, 034050.1 1 
qi|22749039|ref |NP 689708. l| 
qil 13386310|re£|NP 082684.1 



Sequences producing significant alignments: 

qil 5031639 1 ref I NP„0057 67.1 1 cornichon-like [Homo sapiens] >. . 
qi I 27545445 I ref |NP , 034049.11 cornichon homolog [Mus musculu. . 
qil4894209lablAAD323 01.l| cornichon-like protein [Homo sapi . . 
oil 13195570|qb|AAK157 62.ll AF329820_1 cornichon [Boltenia vi . . 

cornichon CG5855-PB [Drosophil. 
cornichon-like [Mus musculus] >. 
hypothetical protein FLJ38993 . 
RIKEN cDNA 2 9 0007 5G0 8 [Mus mus. 
qii 1705957lsp|P52159lCNI DROVI CORNICHON PROTEIN 
qil 31238175 I ref iXP„319722.1 I ENSANGP00000019460 [Anopheles . 
qil 2 8207917 I emb I CAD62 612 . 1 | unnamed protein product [Homo s. 
qill7564146lref iNP 506278. l| Cornichon protein (16.8 kD) [C. 
qil2533682l|pir| IA89261 protein T09E8.3 [imported] - Caenor. 
qi|20853677lref IXP 136930. l| similar to cornichon homolog [ . 
qil27 679416|ref |XP_213960.1| similar to RIKEN cDNA D530030D. 
qil7 661824lref |NP^054903.1| HSPC163 protein [Homo sapiens] . 
qil8778623|qblAAF79631.llAC025416„5 F5011.7 [Arabidopsis th. 

qi|2083 2303lref |XP„129 629.1| expressed sequence AI647760 [M. 
qi I 6321384|ref |NP^011461.1| ER-derived vesicles; Ervl4p [Sa. 
qi|l8391380lref iNP 563903. l| hypothetical protein [Arabidop. 
qi I 9294117 I dbi I BAB01968 . 1 I gene„id : T23B7 . 14-pir | |T06616^sim. 
qi I 6319 687 I ref I NP„0097 69 . 1 1 Putative ER vesicle protein wit. 

putative er-derived vesicles p. 
expressed protein [Arabidopsis. 
expressed protein [Arabidopsis. 
hypothetical protein [Guillard. 
expressed protein [Arabidopsis. 
hypothetical protein [Neurospora. 



Score E 
(bits) Value 



qi|l9115420|ref iNP 594508.1 
qi I 15230426 I ref |NP 187825.1 
qi|30682747|ref |NP 172701.2 
qi|l3812029|ref |NP 113186.1 
qi 1 15221614 | ref \ NP .17 6476 . 1 
qi 128920672 |qb|EAA30033 .1 I 



qil25336823lpirl ID96653 hypothetical protein F16P17.3 [impo. 



ENSANGP00 00 0 010 644 [Anopheles 
hypothetical protein [Arabidop. 

hypothetical protein [Homo sapi. 
putative er-derived vesicles p. 

CG17262-PA [Drosophila melanog, 
^^niSl^Iili^^ F5011.il [Arabidopsis t. 

qi|l7570449lref iNP 508859. ij Predicted CDS, cornichon prote. 



qi|31238868|ref jXP 319866.1 
qi 1 15234412 I ref I NP . 192946.1 

qi |12053 3 55 |emb|CAB6 6 864.1 I 
qi|l9115569|ref iNP 594657.1 

qi|24581337|ref iNP 608745.1 



205 

202 
191 
164 

151. 

147 

144 

139 
139 
13 8 
12 9 

122 
115 

79 



54 
53 

3.1 
"49 
43 
48 

,.i2 
"47 
46 
.45 
44 
__44 
"44 
44 
44 
43 



41 

40 

36. 
„36. 
36 



le-52 m 

6e-52 H 
2e-48 
2e-40 



2e-36 

3e-35 

2e-34 

9e-33 
le-32 
le-32 
6e-30 

le-27 
9e-26 



8e-15 m 
2e-07 ;H 



3e-07 
6e-07 

le-06 
le-05 
2e-05 
3e-05 
4e-05 
4e-05 
le-04 
2e-04 
4e-04 
4e-04 
4e-04 
4e-04 
5e-04 
8e-04 

0.003 
0.007 

0.082 
0.095 

0.11 



Alignittents 




rj>qi| 5031639lref iNP 005767. l| H cornichon-like [Homo sapiens] 

g:U.mZ47 Slire f 1.}^ H similar to cornichon-like [Homo sapiens] [Rattus n 

qi 1 6225176 I sp| 095406 | CNIH_HUMAN ffl Cornichon homolog (TGAM77) 

qi I 40 63709 |qb| AAC983 88 .1 1 ffl cornichon [Homo sapiens] 

gi 14454684 lsbJ_MMPJAOA.LL O cornichon protein [Homo sapiens] 
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qi I 12832170 I dbj | BAB21993 . 1 
qi I 12847169 I dbj |BAB27463 . 1 



I unnamed protein product [Mus musculus] 
i unnamed protein product [Mus musculus] 
qi I 2 202830 0 |qb|AAH348 68. 1 1 S cornichon homolog [Mus musculus] 
Lengtli = 144 



Score = 205 bits (521), Expect = le-52 

Identities = 116/144 (80%), Positives = 116/144 (80%) 

Query: 1 MAFTFAAFGYMXXXXXXXXXXXXXXWHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHA 60 

MAFTFAAFCYM WHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHA 
Sbjct: 1 MAFTFAAFCYMLALLLTAALIFFAIWHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHA 60 

Query: .61 FFCVMFLCAAEWLTLGLNMPLLAYHIWRYMSRPVMSGPGLYDPTTIMNADILAYVQKEGW 12 0 

FFCVMFLCAAEWLTLGLNMPLLAYHIWRYMSRPVMSGPGLYDPTTIMNADILAY QKEGW 
Sbjct: 61 FFCVMFLCAAEWLTLGLNMPLLAYHIWRYMSRPVMSGPGLYDPTTIMNADILAYCQKEGW 120 

Query: 121 CKXXXXXXXXXXXXXGMIYVLVSS 144 

CK GMIYVLVSS 
Sbjct: 121 CKLAFYLLAFFYYLYGMIYVluVSS 144 



□ > qi I 27545445 I ref I NP...034049 . 1 I S corniclion liomolog [Mus musculus] 
qi I 6225177 I sp I 035372 I CNIH_MQUSE ffl CORNICHON HOMOLOG 
qi|246043 0 |qb|AAC15 82 8.1 1 ffl corniclion [Mus musculus] 
Length - 144 

Score = 202 bits (515), Expect = 6e-"52 

Identities = 115/144 (79%), Positives = 115/144 (79%) 

Query: 1 MAFTFAAFCYMXXXXXXXXXXXXXXWHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHA 6 0 

MAFTFAAFCYM WHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHA 
Sbjct: 1 MAFTFAAFCYMLALLLTAALIFFAIWHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHA 60 

Query: 61 FFCVMFLCAAEWLTLGLNMP£.LAYHIWRYMSRPVMSGPGLYDPTTIMNADILAYVQKEGW 120 

FFCVMFLCAAEWLTLGLNMPLLAYHIWRYMSRPVMS PGLYDPTTIMNADILAY QKEGW 
Sbjct : 61 FFCVMFLCAAEWLTLGLNMPLLAYHIWRYMSRPVMSAPGLYDPTTIMNADILAYCQKEGW 12 0 

Query: 121 CKXXXXXXXXXXXXXGMIYVLVSS 144 

CK GMIYVLVSS 
Sbjct: 121 CKLAFYLLAFFYYLYGMIYVLVSS 144 



□ >gi [48942 09igbJ.MPl.l^^^^^^^^ corniclion-lilce protein [Homo sapiens] 

Length = 134 

Score = . 191 bits (485), Expect = 2e-48 

Identities = 105/119 (88%), Positives = 105/119 (88%) 

Query: 2 6 WHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVMFLCAAEWLTLGLNMPLLAYH 85 

WHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVMFLCAAEWLTLGLNMPLLAYH 
Sbjct: 16 WHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVMFLCAAEWLTLGLNMPLLAYH 75 

Query: 8 6 IWRYMSRPVMSGPGLYDPTTIMNADILAYVQKEGWCKXXXXXXXXXXXXXGMIYVLVSS 144 

IWRYMSRPVMSGPGLYDPTTIMNADILAY QKEGWCK GMIYVLVSS 
Sbjct: 76 IWRYMSRPVMSGPGLYDPTTIMNADILAYCQKEGWCKLAFYLLAFFYYLYGMIYVLVSS 134 
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n> gi| 13195570|gb|AAK15762.1 |AF329820_1 cornichon [Boltenia villosa] 
Length = 144 

Score = 164 bits (415), Expect = 2e-40 

Identities = 87/143 (60%), Positives - 101/143 (70%) 

Query: 1 MAFTFAAFCYMXXXXXXXXXXXXXXWHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHA 60 

MAFTFAAFCY+ W I lAFDELKTDYKNPIDQCN+LNPLVLPEY IH 

Sbjct: 1 MAFTFAAFCYIAAIILTAFLIFFAIWQIIAFDELKTDYKNPIDQCNSLNPLVLPEYAIHI 60 

Query: 61 FFCVMFLCAAEWLTLGLNMPLLAYHIWRYMSRPVMSGPGLYDPTTIMNADILAYVQKEGW 12 0 

FF ++FL EW+T+ LNMPL+ Y++WRY+ RPVMS PGLYDPTT+MNAD+L Y +EGW 
Sbjct : 61 FFTLLFLWGGEWVTVALNMPLIGYNVWRYLHRPVMSAPGLYDPTTVMNADVLTYCMREGW 12 0 

Query: 121 CKXXXXXXXXXXXXXGMIYVLVS 143 

CK MIYVLV+ 
Sbjct: 121 CKLAFYLISFFYYLYRMIYVLVT 143 



□ > gi I 17137046 I ref |NP_477 068 > 1 | & cornichon CG5855-PB [Drosophila melanogaster ] 

gi 12458458 5 Iref |NP_72 3 959.1 1 ffl cornichon CG5855-PA [Drosophila melanogaster] 
gi 1 1705956 | sp| P49858 I CNI_DROME Cornichon protein 

gi 1 1079061 Ipir I IA56724 cni protein - fruit fly (Drosophila melanogaster) 
gi i 88 67 69 I gb| AAA86527 . 1 | IS cni gene product 

gi i 72 87965 jgb I AAF45003 . 1 1 AE003 406_2 08 11 symbol =cni; synonyTO=BG : DS02740 . 13 ; match= 
score: '■ 1000.0 , desc: ' 'GenBank: : U2 80 69 : cni FBgn0000339 
SWISS-PROT: P49858 U28069:93. .527 , species: ' 'Drosophila 
melanogaster ' ' ; match=method: ' ' sim4 * ' , score : ' ' 1000 . 0 ' ' , 
desc: ' 'GenBank: : U28069 : Drosophila melanogaster cornichon 
(cni) mRNA, complete cds . CDS:93..527; PID : g886769 . ' ' , 
species: ' 'Drosophila melanogaster' ' ; 
match=method: ' 'BLASTX' ' , version: ' ' 2 . 0al9MP-WashU 
[05-Feb-1998] [Build sol2.5-ultra 01:47:30 
05-Feb-1998] score: ' '264.0' ' , desc:'> 

gi |72 98291 |gb|AAF5 3 521 . 1 | S CG5855-PA [Drosophila melanogaster] ■ 

gi |72 98292 |gb|AAF53 522.1 I M CG5855-PB [Drosophila melanogaster] 
Length = 144 



Score = 151 bits (381), Expect = 2e-36 

Identities - 78/144 (54%), Positives = 96/144 (66%) 



Query: 


1 


MAFTFAAFCYMXXXXXXXXXXXXXXWHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHA 


60 






MAF F AF Y+ +H+IAFDELKTDYKNPIDQCN+LNPLVLPEYL+H 




Sbjct : 


1 


MAFNFTAFTYIVALIGDAFLIFFAIFHVIAFDELKTDYKNPIDQCNSLNPLVLPEYLLHI 


60 


Query: 


61 


FFCVMFLCAAEWLTLGLNiyiPLLAYHIWRYMSRPVMSGPGLYDPTTIiyiNADILAYVQKEGW 


120 






F ++FL EW +L +N+PL+AYHIWRY +RPVMSGPGLYDPTT++ D L +EGW 




Sbjct : 


61 


FIjNLLFLFCGEWFSLCINIPLIAYHIWRYKNRPVMSGPGLYDPTTVLKTDTLYRNiy[REGW 


120 


Query: 


121 


CKXXXXXXXXXXXXXGiyilYVLVSS 144 








K GM+Y L+S+ 




Sbjct : 


121 


IKLAVYLISFFYYIYGMVYSLIST 144 
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□ > qi I 6753470 I ref I NP_034050 ,1 1 M cornichon-like [Mus rausculus] 

qi I 220 62540 I ref I XP_17 0 654 . 1 1 similar to cornichon-like protein [Homo sapiens] 

qi I 6225178 I sp I 035089 I CNIL„MOUSE !^ CORNICHON-LIKE PROTEIN 

qi|4521254|dbi |BAA21746.2 | S cornichon-like protein [Mus rausculus] 
qi I 2 883 8013 |gb| AAH47 953 . 1 1 Unknown (protein for MGC: 50896) [Homo sapiens] 
Length = 160 

Score = 147 bits (371), Expect = 3e-35 

Identities = 83/159 (52%), Positives = 93/159 (58%), Gaps = 16/159 (10%) 

Query: 1 MAFTFAAFCYMXXXXXXXXXXXXXXWHIIAFDELKTDYKNPIDQ 44 

MAFTFAAFCYM WHI I AFDEL+TD+KNPIDQ 

Sbjct : 1 MAFTFAAFCYMLTLVLCASLIFFVIWHIIAFDELRTDFKNPIDQGNPARARERLKNIERI 60 

Query: 45 CNTLNPLVLPEYLIHAFFCVMFLCAAEWLTLGLNMPLLAYHIWRYMSRPVMSGPGLYDPT 104 

C L LV+PEY IH FC+MFLCAAEW+TLGLN+PLL YH+WRY RP +YD 
Sbjct : 61 CCLLRKLWPEYSIHGLFCLMFLCAAEWVTLGLNIPLLFYHLWRYFHRPADGSEVMYDAV 120 

Query: 105 TIMNADILAYVQKEGWCKXXXXXXXXXXXXXGMIYVLVS 143 

+IMNADIL Y QKE WCK M+Y LVS 

Sbjct: 121 SIMNADILNYCQKESWCKLAFYLLSFFYYLYSMVYTLVS 159 



□ > qi I 227 49 03 9 I ref | NP_6 897 08 , 1 | M hypothetical protein FLJ38993 [Homo sapiens] 
gi I 27805431 | sp | Q8TBS1 1 CNI L„HUMAN ffl Cornichon-like protein 

qi|l849080l| qb| AAH227 80 . 1 1 S Similar to RIKEN cDNA 2900075G08 gene [Homo sapiens] 

qi I 217 5577 9 | db j |BAC047 60.l| iH unnamed protein product [Homo sapiens] 

qi I 2 63 2 9545 |dbj |BAC2 8 511 . 1 1 ffl unnamed protein product [lyius musculus] 
Length = 160 

Score = 144 bits (364), Expect 2e-34 

Identities = 85/160 (53%), Positives = 93/160 (58%), Gaps = 16/160 (10%) 

Query: 1 MAFTFAAFCYMXXXXXXXXXXXXXXWHIIAFDELKTDYKNPIDQCNT 47 

MAFTFAAFCYM WHIIAFDEL+TD+K+PIDQCN 
Sbjct: 1 MAFTFAAFCYMLSLVLCAALIFFAIWHIIAFDELRTDFKSPIDQCNPVHARERLRNIERI 60 

Query: 48 LNPLVLPEYLIHAFFCVMFLCAAEWLTLGLNMPLLAYHIWRYMSRPVMSGPGLYDPT 104 

L LVLPEY IH+ FC+MFLCA EWLTLGLN+PLL YH WRY P S YDP 
Sbjct : 61 CFLLRKLVLPEYSIHSLFCIMFLCAQEWLTLGLNVPLLFYHFWRYFHCPADSSELAYDPP 12 0 

Query: 105 TIMNADILAYVQKEGWCKXXXXXXXXXXXXXGMIYVLVSS 144 

+MNAD L+Y QKE WCK MIY LVSS 

Sbjct: 121 WMNADTLSYCQKEAWCKLAFYLLSFFYYLYCMIYTLVSS 160 



□ > gi 1 133 86310 I ref |NP_082 684 .1 1 M RIKEN cDNA 2900075G08 [Mus musculus] 



gi 1 12 851288 I dbj |BAB2 899 6 , 1 1 unnamed protein product [Mus musculus] 

Length =17 9 

Score = 139 bits (349), Expect = 9e-33 

Identities = 84/179 (46%), Positives = 94/179 (52%), Gaps = 35/179 (19%) 
Query: 1 MAFTFAAFCYMXXXXXXXXXXXXXXWHIIAFDELKTDYKNPIDQCNTLNP 5 0 
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MAFTFAAFCYM WHIIAFDEL+TD+K+PIDQCN ++ 

Sbjct: 1 MAFTFAAFCYMLSLVLCAALIFFAIWHIIAFDELRTDFKSPIDQCNPVHARERLRNIERI 60 

Query: 51 LVLPEYLIHAFFCVMFLCAAEWLTLGLNMPLLAYH 8 5 

LVLPEY IH+ FC+MFLCA EWLTLGLN+PLL YH 
Sbjct : 61 CFLLRKVRGVPPGGRRKGRRERGQQLVLPEYSIHSLFCIMFLCAQEWLTLGLNVPLLFYH 12 0 

Query: 86 IWRYMSRPVMSGPGLYDPTTIMNADILAYVQKEGWCKXXXXXXXXXXXXXGMIYVLVSS 144 

WRY P S YDP +MNAD L+Y QKE WCK MIY LVSS 

Sbjct: 121 FWRYFHCPADSSELAYDPPWMNADTLSYCQKEAWCKLAFYLLSFFYYLYCMIYTLVSS 179 



□ > gi I 1705957 | sp| P52159 | CNI„DROVI CORNICHON PROTEIN 
Length = 144 

Score = 139 bits (349), Expect = le-32 

Identities = 78/144 (54%), Positives = 96/144 (66%) 

Query: 1 MAFTFAAFCYMXXXXXXXXXXXXXXWHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHA 60 

MAF F AF Y+ +H+IAFDELKTDYKNPIDQCN+LNPLVLPEYL+H 
Sbjct : 1 MAFNFTAFTYIVALIGDAFLIFFAIFHVIAFDELKTDYKNPIDQCNSLNPLVLPEYLLHL 60 

Query: 61 FFCVMFLCAAEWLTLGLNMPLLAYHIWRYMSRPVMSGPGLYDPTTIMNADILAYVQKEGW 12 0 

F ++FL EW +L LN+PL+AYHIWRY +RP+MSGPGLYDPTT++ D L +EGW 
Sbjct : 61 FLNLLFLFCGEWYSLCLNIPLIAYHIWRYKNRPLMSGPGLYDPTTVLKTDTLFRNLREGW 120 

Query: 121 CKXXXXXXXXXXXXXGMIYVLVSS 144 

K GM+Y L+S+ 

Sbjct: 121 IKLAVYLISFFYYIYGMVYSLIST 144 



□ > qi I 312 3 817 5 I ref I XP_319722 . 1 | ENSANGP00000019460 [Anopheles gambiae] 
gii3 017427 6 |gb|£AA147 88.2 | ENSANGP00000019460 [Anopheles gambiae str. PEST] 
Length = 127 

Score = 138 bits (348), Expect = le-32 
Identities = 68/97 (70%), Positives = 82/97 (84%) 

Query: 2 6 WHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVMFLCAAEWLTLGLNMPLLAYH 85 

+H+IAFDELKTDYKNPIDQCN-t-LNPLVLPEY +H F ++FL + EWL+L LN+PL+AYH 
Sbjct : 18 FHVIAFDELKTDYKNPIDQCNSLNPLVLPEYGLHILFNLLFLFSGEWLSLALNIPLIAYH 77 

Query: 8 6 IWRYMSRPVMSGPGLYDPTTIMNADILAYVQKEGWCK 122 

IWRY +RPVMS PGLYDPT+IMN D+L +EGW K 
Sbjct: 78 IWRYANRPVMSQPGLYDPTSIMNTDVLRACLREGWIK 114 



n > qi I 28207917 | emb I CAD62 612 . 1 1 unnamed protein product [Homo sapiens] 
Length '= 94 

Score = 129 bits (325), Expect = 6e-30 
Identities = 71/85 (83%), Positives = 71/85 (83%) 

Query: 3 FTFAAFCYMXXXXXXXXXXXXXXWHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFF 62 

FTFAAFCYM WHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFF 
Sbjct: 1 FTFAAFCYMLALLLTAALIFFAIWHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFF 60 
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06/07/2003 



.RID=1055020477"012902-3I 



Page 7 of 15 



Query: 63 CVMFLCAAEWLTLGLNMPLLAYHIW 87 

CVMFLCAAEWLTLGLNMPLLAYHIW 
Sbjct: 61 CVMFLCAAEWLTLGLNMPLLAYHIW 85 



□ > qi 1 17 5 64146 Iref |NP._50 6278, 1| 8 Cornichon protein (16.8 kD) [Caenorhabditis elec 
Qi I 20141964 I spj Q22361 I YFR3_CAEEL Hypothetical 16.8 kDa protein T09E8.3 in chroinos 
gi I 7507545 Ipir I |T2475Q hypothetical protein T09E8.3 - Caenorhabditis elegans 
gi| 5824608 |emb|CAB01516 . 2 | Hypothetical protein T09E8.3 [Caenorhabditis elegans] 
Length = 145 



Score = 122 bits (305), Expect = le-27 

Identities - 66/144 (45%), Positives = 87/144 (60%) 



Query: 


1 


MAFTFAAFCYMXXXXXXXXXXXXXXWHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHA 


60 




MAFTFAAFCY+ + +1 DEL+TDYKNPI+QC LN L+LPEY+IH 




Sbjct : 


1 


MAFTFAAFCYLLALIAVGFCIFFAIYTVICVDELRTDYKNPIEQCRNLNQLILPEYIIHG 


60 


Query: 


61 


FFCVMFLCAAEWLTLGLNMPLLAYHIWRYMSRPVMSGPGLYDPTTIMNADILAYVQKEGW 


120 




F V+F+ + + +++ N+PL YHI+ Y RPVMSGPG+YDPTTI-t-N L+ + W 




Sbjct : 


61 


TFTVLFIFSWQLISILANLPLAFYHIYTYAKRPVMSGPGIYDPTTILNRSTLSSTLRISW 


120 


Query : 


121 


CKXXXXXXXXXXXXXGMIYVLVSS 144 








K MIY LV+S 




Sbjct : 


121 


IKLAFYLVSFFYYLYAMIYTLVTS 144 





□ > qil2533682l|pir| |A89261 protein T09E8.3 [imported] - Caenorhabditis elegans 
Length = 136 

Score = 115 bits (289), Expect = 9e-26 

Identities = 60/122 (49%), Positives = 80/122 (65%) 

Query: 1 MAFTFAAFCYMXXXXXXXXXXXXXXWHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHA 6 0 

MAFTFAAFCY+ + +1 DEL+TDYKNPI+QC LN L+LPEY+IH 

Sbjct: 1 MAFTFAAFCYLLALIAVGFCIFFAIYTVICVDELRTDYKNPIEQCRNLNQLILPEYIIHG 60 

Query: 61 FFCVMFLCAAEWLTLGLNMPLLAYHIWRYMSRPVMSGPGLYDPTTIMNADILAYVQKEGW 120 

F V+F+ + + + + + N+PL YHI+ Y RPVMSGPG+YDPTTI+N L+ + W 
Sbjct: 61 TFTVLFIFSWQLISILANLPLAFYHIYTYAKRPVMSGPGIYDPTTILNRSTLSSTLRISW 120 

Query: 121 CK 122 
K 

Sbjct: 121 IK 122 



n > gi I 20853677 | ref j XP_.13 693 0 . 1 1 ^ similar to cornichon homolog [Mus musculus] 
Length = 246 

Score =79.3 bits (194), Expect = 8e-15 

Identities ^ 39/80 (48%), Positives = 44/80 (55%), Gaps = 19/80 (23%) 

Query: 7 AFCYMXXXXXXXXXXXXXXWHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVMF 6 6 

AFCYM WHI+AF+E+K DYK+PIDQC+TLNP 
Sbjct: 18 6 AFCYMLALLLITVLSFFIIWHILAFNEMKNDYKSPIDQCDTLNP 22 9 
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Query: 67 LCAAEWLTLGLNMPLLAYHI 8 6 

EWLTLGLNMP LAYHI 
Sbjct: 230 PEWLTLGLNMPPLAYHI 246 



□> gi|27 679416 Iref |XP_2139 60.1 I S similar to RIKEN cDNA D530030D03 [Mus musculus] 
Length = 139 

Score =55.1 bits (131), Expect = 2e-07 

Identities = 36/118 (30%), Positives = 56/118 (47%), Gaps = 1/118 (0%) 

Query: 2 6 WHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVMFLCAAEWLTLGLNMPLLAYH 85 

+ II +L+ DY N C+ LN V+PE + H F V+ L + W+ LN+P+ ++ 
Sbjct : 22 YFIITLSDLECDYINARSCCSKLNKWVIPELVGHTFVTVLMLVSLHWVIFLLNLPVATWN 81 

Query : 8 6 IWRYMSRPVMSGPGLYDPTTIMNADILAYVQKEGWCKXXXXXXXXXXXXXGMIYVLVS 143 

I-fR++ P G++DPT I N L KE K MI L + + 

Sbjct: 82 lYRFIMVP-SGNMGVFDPTEIHNRGQLKSHMKEAMIKLGFYLLCFFMYLYSMILALIN 138 



n > gi I 7 661824 | ref | NP„054903 . 1 | HSPC163 protein [Homo sapiens] 

gi I 12229837 | sp | Q9P003 | H163_HUMAN Protein HSPC163 

gi I 6841548 jgb I AAF29127 . 1 1 AF161512_1 M HSPC163 [Homo sapiens] 

gi 1 12 65359 5 I gb I AAH00573 . 1 | AAHQ0 57 3 HSPC163 protein [Homo sapiens] 

Lengtli = 139 

Score = 54.3 bits (129), Expect = 3e-07 

Identities = 36/118 (30%), Positives - 54/118 (45%), Gaps = 1/118 (0%) 

Query: 2 6 WHIIAFDELKTDYKNPIDQCNTLNPLVL.PEYLIHAFFCVMFLCAAEWLTLGLNMPLLAYH 85 

+ II +L+ DY N C+ LN V+PE + H V+ L + W LN+P+ ++ 
Sbjct : 2 2 YFIITLSDLECDYINARSCCSKLNKWVIPELIGHTIVTVLLLMSLHWFIFLLNLPVATWN 81 

Query: 8 6 IWRYMSRPViyiSGPGLYDPTTIMNADILAYVQKEGWCKXXXXXXXXXXXXXGMIYVLVS 143 

I+RY+ P G+ + DPT I N L KE K MI L+ + 

Sbjct : 82 lYRYIMVP-SGNMGVFDPTEIHNRGQLKSHMKEAMIKLGFHLLCFFMYLYSMILALIN 13 8 



□ > gi I 877 8 623 | gb | AAF79 631 , 1 | AC02 5416_5 F5011.7 [Arabidopsis thaliana] 
Length = 455 

Score = 53.1 bits (126), Expect = 6e-07 

Identities = 27/83 (32%), Positives = 41/83 (49%), Gaps = 5/83 (6%) 

Query: 2 6 WHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVMFLCAAEWLTLGLNMPLLAYH 85 

+H++ +L+ DY NP D + +N +VLPE+++ ' CV +L W L +P L Y+ 
Sbjct : 15 FHLVCLADLEFDYINPYDSASRINSWLPEFIVQGVLCVFYLLTGHWFMTLLCLPYLYYN 74 

Query: 86 IWRY]y[SRPVMSGPGLYDPTTIMN 108 

Y R L D T I N 

Sbjct: 75 FHLYSKR QHLVDVTEIFN 92 
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□ > gi I 20832303 I ref I XP_129 629 > 1 1 S expressed sequence AI647760 [Mus musculus] 



gi|l2862150| dbj IBAE3 23 6 6 . 1 | Q unnamed protein product [Mus musculus] 
Length = 139 

Score =52.8 bits (125), Expect = le-06 

Identities = 35/118 (29%), Positives = 54/118 (45%), Gaps = 1/118 (0%) 

Query: 2 6 WHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVMFLCAAEWLTLGLNMPLLAYH 8 5 

+ II +L+ DY N C+ LN V+PE + H V+ L + W LN+P+ ++ 
Sbjct : 22 YFIITLSDLECDYINARSCCSKLNKWVIPELVGHTIVTVLMLVSLHWFIFLLNLPVATWN 81 

Query: 8 6 . IWRYMSRPVMSGPGLYDPTTIMNADILAYVQKEGWCKXXXXXXXXXXXXXGMIYVLVS 143 

I4-R++ P G++DPT I N L KE K MI L+ + 

Sbjct : 82 lYRFIMVP-SGNMGVFDPTEIHNRGQLKSHMKEAMIKLGFYLLCFFMYLYSMILALIN 13 8 



□ > gi I 63 213 84 I ref | NP„0 11461 . 1 1 ER-derived vesicles; Ervl4p [Saccharomyces cerevisj 
qi I 1723832 | sp | P53173 | ERV4_YEAST ER-DERIVED VESICLES PROTEIN ERV14 
gi I 2132531 Ipir I | S64058 probable membrane protein YGL054c - yeast (Saccharomyces 
cerevisiae) 

gi I 1322 550 I emb | CAA9 67 5 6 , 1 1 ORF YGL054c [Saccharomyces cerevisiae] 
Length = 138 

Score = 49.3 bits (116), Expect = le-05 
Identities = 22/58 (37%), Positives = 36/58 (62%) 

Query: 2 9 lAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVMFLCAAEWLTLGLNMPLLAYHI ' 86 

I + +L+ DY NPI+ C+ +N L+ PE +H ++FL W LN+P+LAY++ 
Sbjct: 25 ILYADLEADYINPIELCSKVNKLITPEAALHGALSLLFLLNGYWFVFLLNLPVLAYNL .82 



G > qi I 18391380 I ref | NP_5 63 9 03 . 1 1 hypothetical protein [Arabidopsis thaliana] 
Length = 118 

Score =48.5 bits (114), Expect = 2e-05 
Identities = 22/67 (32%), Positives = 36/67 (53%) 

Query: 2 6 WHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVMFLCAAEWLTLGLNMPLLAYH 85 

+H++ +L+ DY NP D + +N +VLPE+++ CV +L W L +P L Y+ 
Sbjct: 15 FHLVCLADLEFDYINPYDSASRINSWLPEFIVQGVLCVFYLLTGHWFMTLLCLPYLYYN 74 

Query: 8 6 IWRYMSR 92 
Y R 

Sbjct: 7 5 FHLYSKR 81 



□ > gi I 9294117 I dbj | BAB01968 . 1 1 gene_id: T23B7 . 14--pir | | T0661 6-similar to unlcnown prot 
[ Arab i dops i s tha 1 i ana ] 
Length =166 

Score =47.8 bits (112), Expect = 3e-05 
Identities = 20/67 (29%), Positives = 36/67 (53%) 

Query: 2 6 WHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVMFLCAAEWLTLGLNMPLLAYH 8 5 
+ +1 +L+ DY NP + +N LV+PE+++ C++FL W+ + +P+ YH 
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Sbjct: 24 YQVICLTDLEADYLNPFETSTRINRLVIPEFILQGSLCLLFLLTWHWVFFLVAVPVTVYH 83 

Query: 8 6 IWRYMSR 92 
Y R 

Sbjct: 84 AMLYKER 9 0 



□ > ail 6319 687 | ref I NP_0097 69 , 1 1 Putative ER vesicle protein with similarity to Erv] 

[Saccharoinyces cerevisiae] 
qi I 586332 | sp | P38312 | YB60„YEAST Hypothetical 16,3 kDa protein in DUR1,2-AME1 inter 
qi I 626236 Ipirl |S46084 probable membrane protein YBR210w - yeast ( Saccharomyces 
cerevisiae ) 

qi I 536592 I emb | CAA85174.. 1 1 ORF YBR210w [Saccharomyces cerevisiae] 
Length = 142 

Score =47,4 bits (111), Expect = 4e-05 
Identities = 22/57 (38%), Positives - 34/57 (59%) 

Query: 29 lAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVMFLCAAEWLTLGLNMPLLAYH 85 

I + +L+ DY N 1+ C +N L +PE ++ AF +FL W LN+P+LAY+ 
Sbjct: 28 ILYGDLEADYINSIELCKRVNRLSVPEAILQAFISALFLFNGYWFVFLLNVPVLAYN 84 



G > qi I 19115420 I ref I NP„594508 . 1 1 putative er-derived vesicles protein [ Schizosacchc 
qi I 622 6447 I sp I Q1403 8 I YEY5_SCHPQ Hypothetical protein C2C4.05 in chromosome I 
gi i7490394|pir| |T38516 cornichon homolog - fission yeast (Schizosaccharomyces pom 
gi I 2414616 | emb| CAB16365 . 1 \ putative er-derived vesicles protein [Schizosaccharomy 
Length = 134 

Score =47.4 bits (111), Expect = 4e-05 

Identities =31/101 (30%), Positives = 51/101 (50%), Gaps = 5/101 (4%) 

Query: 6 AAFCYMXXXXXXXXXXXXXXWHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVM 65 

+A+ Y + + + +LK D+ NPID LN VLPE AF ++ 

Sbjct : 3 SAWIYFTSLMLTCANIMLQMYFTVMYSDLKDDFINPIDLSRKLNWYVLPEMGFQAFSALL 62 

Query: 6 6 FLCAAEWLTLGLNMPLLAYHIWRYMSRPVMSGPGLYDPTTI 106 

L + W+T LN+P+LA++ ++ +MS 4-+D TTI 

Sbjct: 63 LLLSGAWITFLLNVPMLAWN AKMIMSNTHMHDSTTI 98 



□ > qi 1 1523 042 6 I ref i NP_187 82 5 . 1 1 expressed protein [Arabidopsis thaliana] 
qi I 12322050 |gb|AAG51073,l I AC069472„13 unknown protein; 8145-9251 [Arabidopsis tha 
qi I 3 0017219 |gb|AAP12843 ,1 1 At3gl2180 [Arabidopsis thaliana] 
Length = 146 

Score =45.8 bits (107), Expect = le-04 
Identities = 20/67 (29%), Positives = 36/67 (53%) 

Query: 2 6 WHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVMFLCAAEWLTLGLNMPLLAYH 85 

+ +1 +L+ DY NP + +N LV+PE+++ C++FL W+ + +P+ YH 

Sbjct: 24 YQVICLTDLEADYLNPFETSTRINRLVIPEFILQGSLCLLFLLTWHWVFFLVAVPVTVYH 83 

Query: 8 6 IWRYMSR 92 
Y R 

Sbjct: 84 AMLYKER 90 
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□ > qi I 3 0682747 | ref I NP_172701 . 2 | expressed protein [Arabidopsis thaliana] 
qi 128416577 |gb|AA042819 .1 1 Atlgl2390 [Arabidopsis thaliana] 
Length = 137 

Score =45.1 bits (105)/ Expect = 2e-04 
Identities = 21/67 (31%), Positives = 35/67 (52%) 

Query: 2 6 WHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVMFLCAAEWLTLGLNMPLLAYH 8 5 

+ 4-+ +L+ DY NP D + +N +VLPE+ + + CV +L W L +P L Y+ 
Sbjct: 23 YQLVCLADLEFDYINPYDSASRINSWLPEFIVQGVLCVFYLLTGHWFMTLLCLPYLYYN 82 

Query: 8 6 IWRYMSR 92 

• Y R 
Sbjct: 83 FHLYSKR 8 9 



□ > cTi I 13812 029 1 ref j NP_113 18 6 . 1 1 hypothetical protein [Guillardia theta] 
gi 1 13 81208 6 | ref | NP_113 1 60 . 1 1 hypothetical protein [Guillardia theta] 
gi i 13 812215 | ref | NP„1133 4 6 . 1 | hypothetical protein [Guillardia theta] 
gii25396794|pir| |G90096 hypothetical protein orfl60 [imported] - Guillardia theta 
nucleomorph 

gi|25396978|pir| |H90129 hypothetical protein orfl60 [imported] - Guillardia theta 
nucleomorph 

gii25397012 Ipirl |B90133 hypothetical protein orfl60 [imported] - Guillardia theta 
nucleomorph 

gi |'l37 94354 |gb| AAK3 9731 . 1 1 AFQ83 031_88 hypothetical protein [Guillardia theta] 
gi 1 13 7 94404|gb|AAK3 97 81 . 1 I AF083 031_145 hypothetical protein [Guillardia theta] 
gi 1 13 7 94527 I gb | AAK3 9 902 . 1 1 AF165 818_110 hypothetical protein [Guillardia theta] 
Length = 160 

Score = 44.3 bits (103),, Expect = 4e-04 
Identities = 20/53 (37%), Positives = 31/53 (58%) 

Query: 3 3 ELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVMFLCAAEWLTLGLNMPLLAYH 85 

+L TD NP+-I- C+ +N L +PEYL H F + F+ W+ LN P + Y+ 
Sbjct: 38 DLSTDTVNPVEVCDKVNQLKVPEYLAHLFLSIAFVIRGWWIVGFLNFPFIFYN 90 



□ > gi 1 15221614 I ref I NP_17 647 6 . 1 1 expressed protein [Arabidopsis thaliana] 
gi I 26452314 | dbj |BAC43243 . 1 1 un]cnown protein [Arabidopsis thaliana] 
gi I 2 8416891 1 gb| AA042 97 6 . 1 1 Atlg62880 [Arabidopsis thaliana] 
Length = 137 

Score = 44.3 bits (103), Expect = 4e-04 
Identities = 22/64 (34%), Positives = 35/64 (54%) 

Query: 2 6 WHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVMFLCAAEWLTLGLNMPLLAYH 85 

+ +1+ DY NP D + +N +VLPE ++ F CV +L W L +P L Y+ 

Sbjct: 23 YQLISLADLEFDYINPYDSASRINFWLPESILQGFLCVFYLVTGHWFMALLCVPYLYYN 82 

Query: 8 6 I WRY 89 . . 

Y 

Sbjct: 83 FHLY 8 6 
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G > gi I 2892 0 672 |gb|EAA3 0033 . 1 1 hypothetical protein [Neurospora crassa] 
Length =13 8 

Score =43.9 bits (102), Expect = 4e-04 

Identities = 38/114 (33%), Positives = 55/114 (48%), Gaps = 7/114 (6%) 

Query: 2 9 lAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVMFLCAAEWLTLGLNMPLLAYHIWR 88 

I + +L+ DY NPID CN LN ++PE +H F +FL WL h LN+PLLA+++ 
Sbjct: 27 IMYSDLECDYINPIDLCNRLNTYIIPEVAVHGFLTFLFLINGYWLPLVLNLPLLAWNV-- 84 

Query : 89 YMSRPVMSGPGLYDPTTIMNADILAYVQKEGWCKXXXXXXXXXXXXXGMIYVLV 142 

+ ++ L D T I L +KE + K MI L-f 

Sbjct: 85 KKIVDNAHLLDATEIFRK--LNVHKKESFTKLGFHLILFFFYLYSMIVALI 133 



□ > gi I 25336823 |pir I |D96653 hypothetical protein F16P17.3 [imported] - Arabidopsis 
gi I 8493 59 5 |qb| AAF7 5818 . 1 I AC011000_21 Contains similarity to a 14KDa protein found 
vesicles from Saccharomyces cerevisiae gi| 6321384. 
ESTs gb|T22150, gb|AI100633, gb|AA395672 come from this 
gene. [Arabidopsis thaliana] 
Length = 12 6 

Score =43.9 bits (102), Expect = 4e-04 
Identities = 22/64 (34%), Positives = 35/64 (54%) 

Query: 2 6 WHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVMFLCAAEWLTLGLNMPLLAYH 85 

+ +1+ +L+ DY NP D + +N +VLPE ++ F CV +L W L +P L Y+ 
Sbjct: 23 YQLISLADLEFDYINPYDSASRINFWLPESILQGFLCVFYLVTGHWFMALLCVPYLYYN 82 

Query: 8 6 IWRY 8 9 
Y 

Sbjct: 83 FHLY 86 



□ > gi I 312 3 88 68 |ref |XP_3198 6 6. 1 j ENSANGP00000010644 [Anopheles gambiae] 
gi I 213 025 62 |gb|EAA147 07 . 1 | ENSANGP00000010644 [Anopheles gambiae str. PEST] 
Length = 165 

Score =43.5 bits (101), Expect = 5e-04 

Identities = 23/83 (27%), Positives = 42/83 (50%), Gaps = 1/83 (1%) 

Query: 2 6 WHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVMFLCAAEWLTLGLl^PLLAYH 85 

+++I +L+ DY N + C+ LN +P+ HAF + L W+ +N+P++ + 
Sbjct : 33 YYLIILSDLECDYLNAQECCSKLNFWSIPKLAAHAFLTFVLLIHGHWMLCLVNLPMVGWL 92 

Query: 8 6 IWRYMSRPVMSGPGLYDPTTIMN 10 8. 

++ P G+YDP I N 

Sbjct: 93 VYEQYRVPA-GNIGIYDPAEIHN 114 



□ > gi 1 152 34412 | ref | NP„192 946 . 1 1 hypothetical protein [Arabidopsis thaliana] 
gi |7485571 |pir| |T06616 hypothetical protein F16J13.160 - Arabidopsis thaliana 
gi I 4586114 I emb | CAB40950.. 1 I putative protein [Arabidopsis thaliana] 
gi I 7267910 | emb.l CAB7S252 . 1 1 putative protein [Arabidopsis thaliana] 
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Length = 145 



Score = 42,7 bits (99), Expect = 8e-04 
Identities = 20/60 (33%), Positives = 33/60 (55%) 

Query: 33 ELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVMFLCAAEWLTLGLNMPLLAYHIWRYMSR 92 

+ L+ D NP D + 4-N +VLPE+ + C+ ++ W L+ + P L Y+I YM R 
Sbjct: 40 DLEFDRINPYDVSSRINRMVLPEFGLQGLLCLYYILTGHWFMAVLSLPHLFYNIRLYMKR 99 



□ > ai 1 12053355 | emb| CAB66864 . 1 | B hypothetical protein [Homo sapiens] 
Length = 92 

Score =40.8 bits (94), Expect = 0.003 • 
Identities = 21/63 (33%), Positives = 34/63 (53%) 

Query: 26 WHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVMFLCAAEWLTLGLNMPLLAYH 85 

+ II +L+ DY N C+ LN V+PE + H V+ L + W LN+P+ ++ 
Sbjct: 22 YFIITLSDLECDYINARSCCSKLNKWVIPELIGHTIVTVLLLMSLHWFIFLLNLPVATWN 81 

Query: 8 6 IWR 8 8 
I+R 

Sbjct: 82 lYR 84 



G > gi |l9115569|ref| NP_594 657 . 1 1 putative er-derived vesicles protein similar to ye 

[Schizosaccharomyces pombe] 
gii7768496| emb j CAB90792 . 1 1 putative er-derived vesicles protein similar to yeast 
[Schizosaccharomyces pombe] 
Length = 141 

Score =39.7 bits (91), Expect = 0,007 
Identities = 18/32 (56%), .Positives = 23/32 (71%) 

Query: 28 IIAFDELKTDYKNPIDQCNTLNPLVLPEYLIH 59 

+1 F +L+ DY NPID CN LN LV+PE + H 
Sbjct: 31 VIiyiFSDLEI^DYINPIDLCNKLNDLVMPEIISH 62 



>^lili5JA117i.r^^^^^ S CG17262-PA [Drosophila melanogaster ] 

gi 1729 5853 I gb I AAF51153 . 1 1 ffl CG17262-PA [Drosophila melanogaster] 
Length = 157 

Score =36,2 bits (82), Expect = 0.082 

Identities = 25/87 (28%), Positives = 46/87 (52%), Gaps - 1/87 (1%) 

Query: 26 WHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCViyiFLCAAEWLTLGLlSnyiPLLAYH 85 

+ +++ +L+ DY N + C LN V+P++ HA CV+ L W+ LN+P++ + 
Sbjct: 2 5 YYVLTLADLECDYLNAQECCRRLNFWVIPKFGSHALLCVLLLLGGHWViyiFLLNLPMVIWL 84 

Query: 8 6 IWRYMSRPVMSGPGLYDPTTIMNADIL 112 ' 

+ + R G+YDP I + +L 

Sbjct: 85 FYE-LHRQRRDSLGVYDPVDIHSRGLL 110 
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□ > gi I 8778 625 |gb| AAF7 9 633 . 1 1 AC02 5416_7 F5011.il [Arabidopsis thaliana] 
■ Length = 110 

Score =36.2 bits (82), Expect = 0.095 
Identities = 14/42 (33%), Positives = 26/42 (61%) 

/ 

Query: 2 6 WHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVMFL 67 

+ +4- +L+ DY NP D + +N 4-VLPE+++ CV +L 
Sbjct: 23 YQLVCLADLEFDYINPYDSASRINSWLPEFIVQGVLCVFYL 64 



G > gi |l7570449|ref| NP_5088 59 . 1 | II Predicted CDS, cornichon protein [Caenorhabditis 
gi 1 1377 5541 1 gb| AAK3 93 49 . 1 1 Hypothetical protein Y64H9A.1 [Caenorhabditis elegans] 
Length = 5 6 

Score =35.8 bits (81), Expect =0.11 

Identities = 20/51 (39%), Positives = 26/51 (50%) 

Query: 94 VMSGPGLYDPTTIMNADILAYVQKEGWCKXXXXXXXXXXXXXGMIYVLVSS 144 

+M GPG+YDPTTI+N L+ + W K MIY LV+S 

Sbjct: 1 MMPGPGIYDPTTILNRSTLSSTLRISWIKLAFYLVSFFYYLYVMIYTLVTS 51 



Database: All non-redundant GenBank CDS 
translations+PDB+SwissProt+PIR+PRF 

Posted date: Jun 6, 2003 1:54 AM 
Number of letters in database: 463,592,631 
Number of sequences in database: 1,442,039 

Lambda K H 

0.330 0.143 0.492 

Gapped ■ 

Lambda K H 

0.267 0.0410 0.140 



Matrix: BLOSUM62 

Gap Penalties: Existence: 11, Extension: 1 

Number of Hits to DB: 113,816,301 

Number of Sequences: 1442039 

Number of extensions: 4147288 

Number of successful extensions: 8257 

Number of sequences better than 10.0: 37 

Number of HSP's better than 10.0 without gapping: 37 

Number of HSP's successfully gapped in prelim test: 0 

Number of HSP's that attempted gapping in prelim test: 8216 

Number of HSP's gapped (non-prelim): 38 

length of query: 144 

length of database: 463,592,631 

effective HSP length: 120 

effective length of query: 24 
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effective length of database: 290,547,951 
effective search space: 6973150824 
effective search space used: 6973150824 
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